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THE EXPERIMENTAL GROUP IN THIS STUDY WAS 112 
KINDERGARTEN CHILDREN FROM 11 HEAD START CENTERS. IN ORDER TO 
ASSESS THE VALUE OF THE HEAD START PROGRAM, THE MEASUREMENT 
OF THE EXPERIMENTAL GROUP TAKEN DURING THE FALL WAS COMPARED 
TO A MEASUREMENT OF NON-HEAD START KINDERGARTEN CHILDREN 
TESTED ABOUT THE SAME TIME. AT LEAST FOUR MONTHS AFTER THE 
PRETESTING, POSTTESTS WERE ADMINISTERED. THE PRETEST BATTERY 
CONSISTED OF THE STANFORD -BINET CS-B) ANC TWO FORMS OF THE 
PEABODY PICTURE VOCABULARY TEST CPPVT) . THE POSTTEST BATTERY 
CONSISTED OF (1) S-B* (2) PPVT, BOTH FORMS, (3) THE BEHAVIOR 
INVENTORY, AND C4) THE METROPOLITAN READING READINESS TEST. 

THE MOST SIGNIFICANT RESULTS WERE Cl) NO DIFFERENCES WERE 
FOUND BETWEEN THE SCORES OF THE EXPERIMENTAL GROUP AND THE 
SCORES OF THE CONTROL GROUP, C2> THE CHILDREN SHOWED 
SIGNIFICANT IMPROVEMENT BETWEEN PRETEST AND POSTTEST PERIODS 
ONLY ON THE RECEPTIVE FORK OF THE PPVT, (3) TCC CHILDREN 
CONSISTENTLY DEMONSTRATED HIGHER SCORES ON THE S-B THAN ON 
THE RECEPTIVE PPVT, AND (4) THE PERFORMANCE OF THE 
KINDERGARTEN CHILDREN APPEARED TO BE AFFECTED BY WHICH SCHOOL 
THEY ATTENDED AND WHICH TEACHER PRESIDED IN THE CLASSROOM. 
ALSO 160 HEAD START CHILDREN, WHO WERE TOO YOUNG TO START 
KINDERGARTEN, WERE TESTED IN THEIR HOMES AND WILL BE USED FOR 
A FOLLOWUP STUDY. (WD) 
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Introduetioiis The luiture and eeope of the ftohXeii* 

A folloif«>up etiidy of e Heed Start poptilatioa ispliee an eeeloetloii 
of pupils y progrsMS or soee conbluatloa of both* The designer of such 
a study has an option to exanine those irariahles in macroscopic or 
microscopic fashion in the sense that he can choose to examine for the 
presence or absence of effects in grossly dissimilar populatloes (axperi* 
mental and control groups), or he can choose to relate particular hinds 
of effects to particular kinds of programs and pupils* The latter design 
is capable of giving more powerful ansvers to the <|t*sstions raised by 
the theoreticians and experimentalists, but it can be rather restricted 
in its capacity to generalise* The precise design and, careful sampling 
required of the microscopic approach dictate a need for exttamaly large 
populations to fill the several cells which would result if meaningful 
generalisation is a goal of the investigation* Thus, only a study of 
national scope could hope to fill the cells corresponding to a good sample 
of programs, teachers, snd pupils* hack of a ^population of this magnitude 
drastically inhibits the generali*.ability of any microacopic study* 

Macroscopic studies may allow for broad generalisation, but the st»« 
stance of that which is generalised Is too often or little signiflcsnce* 
Simply searching for the presence or absence of an effect may combine 
gains and loaaes into aema changes that obscure the real dynamics undar* 
lying the program* A large uni-dimenaional program may be examined 
meaningfully in this manner, but such a program cannot be lound in educa<» 
tional fields and Is certainly not ehsrscteristic of Mead Start programn* 
The goal of the present investigation is to effect e compromise 
between those two approaches* We would like to achieve scam degree of 
generslissbility; in this esse we mean a capacity to gar^rallaa to tha 
Haad Start program and children of Washington, D«C* To tha axtant that 

Washington is repraammtative of urban centers providing Haad Start to 

the Children of the ghetto, one would be in a position to make fuifther 
atatementa about theaa kinds of programs* It is not, hsmauag* ouir 

intantion to make this leap, ao that we will restrict ouraelvaa to simply 

the Washington situstlon* Consequently, our ssmpling wee far rasging amd 
thin, although with certain raatriciton intrinsic to Washington* Thaaa 
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will be aiseoesed below, but for the present an exanple can be given* 

Our population attended autuBer 1965 Head Start which waa Jointly funded 
by OEO and Title t, ESEA. Consequently, each of the 48 Centers eatab- 
lished by the Washington, i>* C. Board of Education for the suOMer program 
was located in an elementdry school designated as a Title 1 target school* 
This meant that a very large majority of the pupils attending the school 
were from families meeting the Title I poverty criteria* It also meant 
that the n€lghbor5«ood serviced by that elementary sciiool Is, from the 
socio-economic standpoint, very low and homogeneous* Finally, It means 
that the conmuntty and the school Is better than 98% Negro in composition* 

It was our intention to identify the sample in Head Start and follow them 
in the kindergarten offered in the same school. The eleven sample Centers 
had a total enrollment of approximately 900 children during the summer* 

A search of the kindergarten classes offered by these same schools and In 
which the Head Start sample children should have been enrolled, revealed 
only 340 Head Start graduates. Clearly there Is a great deal of transiency 
of some kind in this population. Our generalisations are therefore, 
restricted to a population somewhat more stable thms the majority of the 
Head Start population. The control group was also selected from the 
kindergarten enrollment, but their rate of transiency can only be guessed. 

If their rate of transiency is the same as the rate of the Head Start 
population, then it can be assumed that this control group Is made up of 
about one third stable families and two thirds transients who came from 
other sections of the city. Thus, our control population may ba somewhat 
less stable than the Head Start population. It la unlikely that the control 
group has a lower rate of transiency than the Head Start population so 
the least that can be said about them is that they have either the smue 
degree of family transiency or s greater degree of transiency than the 
Head Start population. Although this will reduce the internal validity 
of the present design to some extent, :he major Impact of these facts Is 
in restricted generalisations. We can draw conclusions only about Head 
Start children who remain In the neighborhood at least long enough to 
attend kindergarten In the same building In which they attended Head Start. 

Cenerallaabillty is not, however, the Sily goal. With a large 
enough population, we could seek to relate the variations in output with 
















several kinds of independent measures* Those measures of greatest 
interest include the nature of family structure from whence the 
child comes, the nature of the Head Start teacher *s behavior toward 
the child, and the program means and goals guiding the teacher* On the 
dependent side, we would be interested in cognitive-verbal measures, 
skills in dealing with stress producing situations such as test-taking, 
social-emotional measures, and academic achievement in public schools 
following the Head Start experience. 

However, the examination of seveiral of these microscopic variables 
in the context of a study large enough to allow for Interesting generali- 
sations would require an extensive sample and research team* The 
compromise arrived at in the present investigation was to restrict the 
measurement to the least number of theoretically important variables and 
maintain the largest sample possible* Tn order to achieve this goal several 

categories of variables were excluded from this study. First, all measures of the 

family were excluded, except for a sampling of the socio-economic status* 

The reason for this is that Interview material and survey research methods 
yield little of value In predicting behavior of preschoolers. Observation 
of parent-child interactions, and other family structure variables, while 
potentially of much greater value in such predictions, are extremely 
cumbersome and therefore quite inappropriate to relatively large scale 
data collection projects. Failure to measure these extremely Import^int 
variables in a follow-up study such as this must result In heightenc;d 
within-group (error) variances and a consequent obscuring of the true 
differences between groups. This is the price to pay for generallzabillty* 

Since it is known that the families living In the neighborhoods 
serviced by the sample schools are quite homogeneously low socio-econom- 
icaliy, a sample of these families was taken to confirm the nature of our 
experimental sample. Our sample does not deviate from the kiiown character- 
istics of the neighborhoods, so that we can accept their status as described 
by the Board of Education* 

The next category of variables excluded was that of program and 
educational goals* A large city school system will of necessity have a 
great variety of teachers translating the standard curriculum in a variety 
of ways* Rather than searching for curriculum similarities and differences, 
it was decided that a more powerful source of variance in child behavior 
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it a deacirlptioti of the teacheir*s actual behavior toirardi the child* 

Since » aa we will describe below» we are liitereated In the vettbal 
behavior of the child, a measure of verbal Interaction between teacher 
and child was chosen as an Indicator of the lnj>ut dlnenalon of the Head 
Start program. Unfortunately, a major compllcatlcm In the orderly 
process of this investigation occurred, re<suirlng a shift in the 
research design. 

This project was funded by OEO during the spring of 1966 , and 
preparation for work was begun at that tifie. However, because of 
certain complicated fiscal arrangeviients between the granting agency 
and the University, funds for the work were not released for expendi- 
ture until the last week of Head Start operation at the end of the 
summer* Personnel to be trained for Observation could not be hired 
until that date, nor could selection of the sample* Consequently, the 
complete set of measurements on the total sample were taken after the 
Head Start experience* It was for this reason that the sample was 
selected from the population of Head Start graduates attending kinder- 
garten, rather than from the population in attendance during the 
summer program* Since our pretest measures were taken during the fall ^ 

of 1966 (and since it was not possible to observe pupil teecher Interec- 
tlon in the kindergarten class) , this study Includes no meaaurea of the 
educational input to which dependent measures might be referred* Our 
attempt to be representative of the Head Start classes in our snap ling 
does, however, ellew us to speak of the effects of the kind of program 
a large city developed for Its summer classes, on the children of the ^ 

lower Income groups who remained In their neighborhoods to attend 1 

kindergarten In the stme schools* 

On the dependent side, our interests were In estimetini the nature 
end extent of change in the Head Start chlldU skill In dealing with 
situations of personal evaluation* We reasoned that s valid reflection 
of the reduced sense of self worth that Is assumed to be a part of the 
child of poverty, particularly as expressed In the tendency to school 
fsiluret Is the reaction the child has to being tested* A reduced sense 
of oelf worth means, at least logically If not psychologically, an 










anticipation of failure and a consequent heightened fear of testing 
situations. It was our intention to assess the general level of 
anxiety displayed by the child in the Head Start situation and the 
specific anxiety displayed in the testing situation. Since we 
were unable carry out i^ny observations of the child in either 
the Head Start class or the kindergarten class » our estiwates of 
these properties of our sample are restricted to data collected in 
the testing situation and to judgments made by the testers and 
teachers. Several methods were devised to utilise standard test 
scores as estimates of test anxieties. 

In any such follow<*up study » a cont:;ol group design is a 
desirable » but not necessary approach. Hatching control and 
experimental groupa has such serious problems that Campbell and 
Stanley for example ( 1966 ) indicate a strong ptefereace for 
randoiiilxed procedures and abandonment of matching whenever poseible. 

In the present instance* randomization was not possible since we could 
have no control over the admissions procedures of the Bead ftart staff, 
further* since it is not possible to measure any of the non^-child 
Independent variables (family structure* educationel iafut or pre- 
kindergarten experience of the control group), matching could occur 
on a restricted set of variables. We could ensure that the control 
and experimental eiiild cane from the time neighborhood «id socio-economic 
level as the experimental child by matching for kindergarten class* This 
would at the same time ensure roughly comparable kindergarten experiences 
Age and sex could also be matched. However, it is not poesible to matdi 
or measure the children on dependent measures prior to the Heed Start 
experience, the logic of comparison between experimentel amd control 
is somewhat impaired by this situation, although patterns of similari- 
ties and differsncea between these groups might very well compensate 
for unmeasured differences between them prior to the experimentel treet- 
ment. In fact, a comparison of the kind of changes which teke place 
over time within each of tlie two groups, can be reesonid>ly made without 
reference to the beginning level of performance for either of them, 
the gueetion to be asked of the diita under these conditions is: idiat 

is the impact of kindergarten experience for children with and without 



He«l Start aacperlence? Clearly* It la helpful* hut not tteeei»aary 
to have experlaetital and c<witrol children aatched >n dependenc 
neaaurea at the beginning of Head Start, If thle question Is asked* 

The sample. 

The total enrollment roster of 7500 children for the s«*mer 
1966 Head Start program was obtained from the Board of Education. 
These chlldtv^n were distributed across 48 Centers enrolling from 
fewer than 50 to more than 200 children in each. The Centers were 
located in ptd>llc school buildings in the target areas designated 
by the Title X, BSEA criteria. Children from the Immediate 

neighborhood attended each school. 

k smiple of eleven Centers were selected from the total to 
reflect the dlatrihutlon of the number of classes in ea^di Center. 
Thus the sample Centers had the same distribution of site of 



classes as the total sample. The 1775 children enrolled in these 
Centers coostltuted the pool of experimental subjects for 
this study. This smaple of children also reflected the age 
distribution of the children in the total population. If we divide 
the total population Into an older group (those who would be five 
years old by Hov. 1, 1966 and therefore eligible to enter klndergart^ 
after Head Start)* and a younger group (those too young to be mllgibie 
to enter kindergarten in the fall following thair Head Start experlenca) , 
the older group conatitute 34% of the total enrollment and the 
younger 46%. In the sanple Centera, the older group conatituted 53% 
and the younger 47% of €mrollees. 

Older group 

In order to select the simple of experimental children* it was 
necessary to identify those who actually entered kindergarten. It 
was expected that many children would move out of the city before 
school begin* thst wut&y would not attend kindergarten in the aame 
neighborhood school in which their Head Start class was bald* and that 
many would not attend kindergarten for a variety of reiaone. This was, 
in fact* the case beceuae after careful exsminatiott of the kindergarten 
Claeses into which tlie children of the eleven experimentel Centers 
should have been enrolled* only 340 Head Start children were identified. 















till® group then constituted the pool of subjects irosi iihlch the 
ssiple was selected* It Is not possible to determine where the other 
Head Start children are without exaaiining the enrollnent lists In 
each of tile 140 elementary schools In the District of Coliariila* 

Our expermental sample Is limited simply to those diildren who are 
now enrolled in the kindergartens serving the neighborhoods In which 
they live and In which their Head Start class was located* It is 
not possibie to estimate the similarity or dissimilarity between 
those Head Start children who did get into our sample md those who 
did not because the latter cculd not be found. Our concltislons in 
this Investigation must be tempered by this fact. 

Of the 340 Head Start kindergarten children, a total of 112 
were selected to be experimental group* These were selected so that 
we obtained an eq^ial number of boys and girls, and so that the 

distribution elre of CMters In the total population was reflected 
in the sample* 

A control group was established by selecting a match for each 
child In the experimental group. This means that for each Head Start 
child I, a non«>Head Start child enrolled in the same kimdergarten 
class and of the same age end sex was selected as his control* Thus, 
a total of 112 control children was selected* 

Tounger grouo 

Of the 820 children In this group, IbO were selected to reflect 
the prmpottionate contribution of the slxe of their Centers to the 
total distribution of site of Centers. 

The parents of each of these children were telephoned to establish 
the status and «ligiblllty of the children. Those wl o were enrolled 
liTi a prc'^school, dttf care center, or other educational i ns titution, 
were excluded from the sample. Those whose parents could !^ot be found 
werife also excluded* This left 90 children who were eligible to be 
Included In the sample of younger children* In each case, parental 
petmlssion to have the child tested by the project staff was secured. 

In order to establish the comparability of the experimental sad 
control groups on sociv*economic measures; sn Interview yielding the 
following data was administered to both groups: $4% of the mothers 

and 71* of the fathers cd the experimental children were living at 



home the childreri whereas 100'^ of the mothers and 80^1 of the fathers 
of the control children weie living at home* 'Hie mean number of children 
in the experim ntal fajRilies is 3*1 and in t!ie control families is 2*1} • 

Ittie mean highest grade completed is 9th grade for all parents in both groups 
‘ttie mean total annual family income of the experimental group is ^l43-'0 and 
for the control group, this figure is *^100* Although both groups ai'e at 
poverty levels, it is apparent that neither represents the hard core of 
the poverty popxjlation and that the control group is slightly higher on the 
economic scale than the esq^erimental gi‘oup* However, income level on 
either the Binet or the Peabody test did net correlate greater than zero, 
so that such differences in the experimental and control population can 
be considered insignificant for the present purposes* We feel justified 

in assuming comparabili% between these groups* 

j 

The measuring instruments * 

1* Stanford-Binet, revised, I960 edition* This instrument was used 
because it provides a broad, reliable measure of cognitive functioning* It 
is a relatively high predictor of academic success and it is used as a base 
against which we msh to evaluate the instruments described beloir* Finally, 
the face sheet of this measure contains a thirteen item rating scale of the 
test- tarring behavior of the testee which has been found to be predictive of 

ji 

total IQ and performance in pre school for similar populations (Hess, 1966)# 

Test taking anxiety, a variable of considerable interest to the presefit stu<^f, 
is measured by sev^eral items on this face sheet and b^ at least one factor 
which emerges from a factor analysis of the items# 

2# Peabody Picture Vocabulary Test* This instrument has been used in 
several studies of Head Start children with similar results: PP?T test 

scores are t^'/pically lower than Binet IQs and do not correlate as highly with 
Bisiet scores when compared to data collected from normative (white, middle 
income) children (Berger, 1966| Burnett, 196i}-6$| Bisenberg, 1966| Harding, 1966 
Ozer, 1966 $ Rice, 1967)* This might be because the Binet has less of a verbal 
content at the jfounger age levels* It might also be, however, that (as argued 
by the present writer (Cline et al, 1966) the PPVT tends to inhibit the perfors<iance 
of 'hildren who have a distaste for being measured* A receptive language task 
such as uhe Peabody, which allows only a single right or wrong answer to each 
item, may prove too restricti%"e to children who feel uncomfortable in a testing 
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dltii^itloii. This would be pairticulairly true for those children uho ere, 
tft feet, irerbelly facile as well as anxious In the testing situation* 

Such an expectation is consistent with the literature on test taking 
anxiety (e*g*» Ssrason* et al, 1964, Zigler and Butterfield, 1966)* We 
have predicted, and previously found (Cline, et al, 1966), greater 
discrepancies between the Binet and the Peabody for children above their 
group Binet sedlan scores, indicating at least that the discrepancy 
between these aeasures is not limited to reduced verbal content of the 
Binet at these age levels* It should be expected that the Head Start 
experience would be reflected in the nature of the discrepancies between 
Binet and Peabody scores, if the Head Start experience does contribute 
to a child’s skill In handling himself in a testing situation* 

If the receptive form as well as the verbal content of the Peabody 
is an important source of variance in this kind of population, then 
holding the costent constant and transforming the test to an expressive 
form, should have a significant effect on the performance of the child 
with a distaste for being tested* Such an expressive form has been 
developed by the present writer, and put through several revisions* Its 
current form ut Hires plates 25 through 62 of the Peabody test, with 
procedures for administration and a scoring system that have been used 
several hundred times, (see appendix) It has not however, been used on a 
kindergarten population. To this point in the development of the instrument, \ 
dhildren generally score higher on the expressive (hereinafter referred to 
as the E form of the Peabody) than the receptive (hereinafter referred to 
as the R form of the Peabody) , and children with higher Binet scores 
acquire greater discrepancies of the E over the R* However, it Is not clear 
that such predictions would be made for an older kindergarten group* We 
simply predict that the E form Is less Inhibiting than the R and that this 
relationship should discriminate between groups receiving and not receiving 
a Head Start experience if such an experience is effective In increasing 

the confidence a child has in taking tests, 

3. Behavior Inventory. This Instrument Is a 50 item rating scale 
developed by Head Start to assess the social-emotional status of pre-school 
dbildren* It is used by the teacher to rate the child* Preliminary work 
with this lnstnm«nt Indicated that a factor structure emerged from its 
analysis which contained at least one factor which appeared to be conceptually 
related to the notion of test taking anxieties* Consequently, the measure 
was included in the present investigation In an attempt to discover such 
a usable factor In the analysis of our data. 
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4. Hetropolltan Reading Readiness Test. This Is the laeasure adnlnis*' 
tered by the school systcan at the end of the kindergarten year to assess 
the readiness of the child for first grade. We shall use these scores 



as the aeasure of acadeinlc success. 

The evaluatlonal hypotheses follow from the expectation that the Head 
Start experience yltlds improved performance on the Blnet and the several 
forms of the Peabody (although this is based on the unproved .'assumption 
that control and experimental children were at comparable levels on these 
instruments prior to Head Start); that the Head Start experience is associated 
with reduced discrepancy scores, greater gain scores over the kindergarten 
period on the Peabody R and E forms, better academic achievement, and better 
social-motional adjustment as revealed by the Behavior Inventory. 



The evaluation design. 



The original plan called for the start of the testing program during the 
first week of Head Start. Since the project was not activated until September, 
this was impossible. The smople was selected by late Septead>er, so that 
testing could not begin until then. The revised design called for pre testing 
as early in the school year as possible, post testing after a minimum f four 
months, and the administration of the Behavior Inventory at the same time as 



the post testing. The reading readiness test is administered by the school 
system to all kindergarteners early in May so that this date became the end 
point of the testing program. 

The pretest battery included the full scale Stanford-Binet, Peabody-R 
and Pcabody-E. Binet testers were also instructed to fill out the face sheet 
for each child. Blnet testers were hired from the ranks of those who had 
completed a graduate level course in intelligence testing, who had experience 
under supervision in administering the Stanf ord-Binet , and who administered 
the Binet to at least three low income Negro preschoolers under the supervision 
of the staff director of testing. All Blnet testers were white, nine were 
women, two were men. All followed the same set of directions and procedures. 

The 18 PeiH»ody testers %iere undergraduate Negro women (except for one 
White woman and one Negro male) none of whom had administered any sort of 
test to children before. All were trained by the project staff director. 

Half the testers were trained to administer the £ and half the R form* In 
order to assure the reduction of tester bias, the following procedures were 
adopted: All children were randomly assigned to testers; no teste? could 

test the same child twice with the same instrument or with different instruments; 








tko tester could tc^eul the score of any child to aay other tester (scores 
vere reported ouly ii* code) and no tester knew idilch children were Heed 
Start and which were controls* Seee Binet testers did adeinister the 
Peabody to soae children* but never to the saee child to who* he had 
adaiinistered the Binet* Peabody-R tests were scored by the tester* but 
the Peabody-E tests were scored by two independent judges >»ho adjudicated 
any differences in the scoring of a child’s verbal response in conference* 

No differences were allowed to reioain in the scoring of the E# 

In order to eliminate order effects in the adnlnistratloo of these 
instrunents* the following procedure was adopted* the Peabody tests were 
always adaiinistered before or after the Binet* Half the sid>jects received 
the E first and half received the R first* This procedure nade tester 
assignments extremely awkward so it was decided to analyze for order effects 
after the pretest to see if such a balancing was necessary* As reported 
below* there were no differences between any of the tests associated with 
their position in the order of administration* Consequently* the balanced 
order of the Peabody was abandoned for the post testing* although the two 
Peabody tests were always given* before or after the Binet* 

{iSMfediately after the pretesting of the older group in si^ool was 
finished, the younger children (l*e*, those who had attended lead Start 
in the stssMtr but who were not old enough to attend kindergartam) were 
contacted in their homes* Arrangements were made to test In the home and 
this was accomplished In the late fall* Only one administration of the 
battery composed of the Binet and the two forms of the Peabody was carried 
out for this population. These Children were to be used for a follow-up 
study once they returned to Head Start the next stsmmsr* 

Table 1 suMMirizes the number of tests given to the several populations 

at the several times during this study. 

T^le II simmarises the number of boys snd gitls in the seversl age and 

experimental groups* 



Basults* 

Attrition* Although oversampling waa carried out in order to compensate 
for the expected lOX lose of subjects from pre to post, a considerably large 
rate of attrition occurred* Each of the dependent meaaures were given on a 



different day> iililch produced a different rate of attrition for eadi* These 
rates u^aried between 20—30%* In order to determine if this loss represented a 
selective withd awal of low or high performing subjects which might serve 
to bias the post test sample, a cemparlson of the pre test scores for those 
subjects who received a post test and those subjects who did not receive a 
post test was carried out for each of the three dependent measures* Table III 
summarizes these results, which clearly indicate that the sample was not 
biased by the loss of subjects. In no case is there any differences between 
the pre test scores of the subjects who did and who did not take post tests* 

Tester effects * The total number of tests administered by each tester 
was tallied and a simple one way analysis of variance run in order to estimate 
tester variance* No significant effects were found even though two of the 
testers produced a mean IQ on the Blnet of 77, and two testers produced means 
of 84 and 86 respectively* Peabody testers were slightly more uniform with 
means ranging from 82—94* Clearly there is some tester variance (see appendix, 
Tablesli-*iy> but the within variance Is much too great in this population to give 
this factor much credence* It would not be wise, however, to generalize these 
results to other populations or testers* 

Order Effects * Mean scores for pre test PPVT-R and PPVT-E In each 
ordinal position were calculated* No differences were noted indicating that 
it makes no difference whether the E or R form of this instrument is administered 
first* The Blnet was administered to some children before and some after 
the Peabody tests but never between the two* No differences In Blnet scores 
related to ordinal effect was noted* All instruments within the battery 
were administered to each child on separate days but within a week of each 
other* 

Binet results * Table IV suMiarizes the pre and post test Binet IQs 
for the several populations* The median IQ for the total population is 
89*5 and in subsequent analysis, all S*s are sorted In High Binet (90 and 
higher) and Low Binet (89 and lower) groups* This split is appropriate 
because we wish to know if the discrepancy between the Peabody scores and 
Binet scores is the same for high and low Binet scorers* 
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In otd«r to rnttiftm thoio do«:o furthot, m mmlymim of w^timcm wltli 
utioquol n't woo eatriod out on the pro teet Blnet eeoree egelnet Sex U)» 
ExpetliMtttei Stettta» Bead Start/ Control (B), and High (aboee Mdlan)/ 

Low (below nadian) Blnet IQ (C)« Thla analyala la ataataarlaad In table V* 

Clearly the only algnlf leant effect is (C) whldi la alnply a eonflmatlon 
of the high/low Blnet apllt« Most ii^>ortant la the non-algnlfleance of (B), 
Experimental atatna. At the beginning of kindergarten there were no dlfferaneea 
between Head Start and control group children on Blnet IQ, and no dlffereneea 
between boya and glrla. If any differences existed as the reanlt of Heed 
Starts they havre disappeared by the first month of kindergarten. On the 
other haed» the experimental and control children were not maaanred prior 
to Bead Start and It la conceivable that the Head Start childran were below 
the controls at that time so that the result of the summer program was to 
raise the Head Start children to the level of the controls, fhii point 
cannot be evaluated* although it seems unlikely# However* If the summer 
did produce meanlnfful gains for the Bead Start Children* this should be 
refleeted la some pre test advantage over the cotttrc»l children. If this 
Is not true* It ml^t be expected that the momentum generated over the 
summer* which brought the Head Start child to the level of the control 
child (if In fact they were not equal before Bead Start) should carry ths 
experimental Child forward in kindergarten. The analysis of the rest of 
our data should speak to this issue. 

An analysis of variance of Blnet gain scores against Sex* Experi- 
mental Status and Bl^/iow Blnet pre la summarised in table fl. the only 
significant effect Is (C) High/Low Blnet indicating that the children with 
high pre test Blnet scores show significantly different gain scores than 
the Children with low pre Blnet scores. However* further examination of 
these restilts suggests that this might be e spurious factor, the mama 
gain (mean of the distribution of gains) for the high Blnet group is minms 
2 IQ points* sad the mean gain for the low Blnet group la plus S IQ points* 

A loss em post test scores for a high group and a gain on post test scores 
for the low group Is precisely what is expected If the true mean gain is 
aero, this Is the regression phenomsnon In which the prediction of post 
from pre Includes an asymotrlcal distribution of post scores at the upper imd 
lover extremee. It would be dengeroue Indeed to refer to there reeulte ee ea 
Ineteace of true diange ovar tlma. 

Of greatar leportaaca la the raault that both axperlmantal and control 
group# were Identlcel In their failure to ehow geine over the klndergerteu year 
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(1*9 and .7 IQ iaproi/^emeiat respectively)* Sex differences in gain also 
failed to appear* , « 

PPVT-E Table iv suamaari^es the pre ai\d post PPVT^R IQs for the 
several poptilations * TableVl| summarizes the analysis of variance of the 
pre scores and indicates that Head Start children wer^ not different from 
control children at the beginning of the kindergarten year on this Instrument* 

An analysis of pre-post (repeated measures) scores J iidicate that a significant 
improvement in PPVT-E scores ia apparent (F • 23*08, df 1/164, p < *001)* 

Table VUlSfimsiar ires the analysis of var.t^ce of th^se gain scores and 
indicates that there are no differences in the several groups in amount of 
gain. That Is, boys and girls, Head Start children and control children, 
high Binet children and low Binet children, all gained about 6*7 points on 
the PPVT-R over the kindergarten year* In order to examine this finding 
for regression effects, the total population was further split at the 
median pre PPVT-R score (85 IQ) , and the high and low groups examined 
for their respective gains* The high scorers show no difference between 
pre and post (*.3 IQ points) whereas the low scorers show a gain of 11*4 
IQ points* It is not clear whether this represents a regression effect or 
a differential impact of the kindergarten year on the low compared to the high 
PPVT-R children* The total distribution is displaced so far downward that the 
lov^er end contains a disproportionate niraber of extreme scores, whereas the 
high group contains very few extreme scores when compared to the normative 
group* Under these conditions, one would expect greater regression to the 
mean for the low than the high group, but this is not established one way 
or the other in these data* Correlates of PPVT— R gain scoreif are reported 
below* 

PPVT-E Table IV sutonarlzes the pre and post scores for the several 
populations* Table XX summarizes the analysis of variance of the pre scores 
against sex, experimental statuf , and high/low Binet* Significant main effects 
occur for sex (boys are higher) and high/ low Binet (high Binet scores are 
associated with high E scores), but there are no effects attrlbut**ble to the 
Head Start experience. A repeated measures analysis revealed no significant 
change In the E from pre to post, but an analysis of the gain rcores (Table X ) 
indicates that high and low Binet scorers show differential gains on this 
fora of the PPVT, Thus high Binet children show a mean gain of 2 points 
on the E and low Binet children show a mean loss of 2 points. Of greater 
significance for the present purposes is that there are no differences 
associated with the Head Start experAence* 

In order to examine for regression effects In these data, the E distribu- 
tion was split at the median and the gain scores for these above and below 

separately ccmputed. High children show a mean loss of 5*07 points and 
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lot» children sho’..' a mear. ^ain of It.EC foints. As >iith the Einet results, 
these data look very much like the artifacts of reRrossion rather than 

time changes in e^^reme populations* 

»5*ablo XI suMJiarises the intercorrelations between the Binet, and 

E measures* 

Ifetropolitar Readin.^ Headltiess Test , This instnmient was adni.ristercd 
to all children in each of the sairole classes and total class scores were 
made available to the present investigation. These data were handled in 
two ways* First, the mean total mi scores for the c::p-erimental and 
control childi-en 'fere tallied and analyzed* This allows a comparison of 
these groups to each other and to the national norms established for the 
instrmJient* However, we also^Hlsh to know the position these groups have in 
their class mth respect to reading readiness* lhat is, independent of 
absolute level of achievement, are Head Start and control children typdcally 
at the top, middle or bottom of their classes* Gonsequentls/*, the total 
distribution of each class was established and each of the sample cnildren 
was assigned a score to reflect his rank in a three position (top third, 
middle third, bottom third) distribution of scores. Tables XII and XIII 
summarized the analysis of variance of the MEET total scores and the liixET 
class ranks respectively* In both instances a significant main ef lect is 
found for sex (girls score higher than boys and aie tj’pically at the higher 
rcnic positions in their classes than boys) and Binet IQ (in a system in 
which rank 1 is the bottom third, rank 2 is the middle third, and ra^ 3 
is the top third, the mean rank for boys is 1*86, for girls 2*26, F - 15.8, 
df l/l8l, p<r*0l5 mean rank for high Binet IQs is 2*lil and for low binets 
IQs 1*72, F s h5.23, df 1/181, p<*00l5 mean rank for Head Start children 
2*03, for controls, 2.10 (ns) ). Clearly, if Head Start generated momentum 
in these children, it was dispelled by the end of the kindergarten year* 
Behavior Inventory* This 50 item rating scale was filled out by 18 
teachers describing 600 children including the present sample i>opulation at 
the end of the school year. These data were factor analysed (principal 
components, varimax rotation) and a three factor structure emerged wh&h is 
both interesting and usable (see appendix). Factor 1 has been labeled 
¥ithiratd.ng. Inhibited, Caster liilquetoast. Factor II has been labeled 
. Bpulsive, Low controlled, Donald Duck* Bie third factor has been labeled 

Hon^persistence in Broblem Solving* 

Each sample child was rescored by summing his scores on the items 
loading on each factor, and t-’.ree factor scores uore then recorded for each 
chUd. The analysis of variance of these scores are given in Tables XIV, XV, 



Factor I significantly differentiates high and low Blnet children* 
but no other main effects or interactions are noted# This means that lew 
Blnet children tend to be judged by their teachers as withdrawn and 
inhibited* and high Blnet children are not so judged by their teachers# 

However* at the end of the school year there does not seem to be any 
differences between Head Start children and control in their teachers 
jpareeptions of them on this dimension. 

Factor II reveals no significant main effects but It does show 
an interaction between sex and high/ low Blnet# Teachers see low Blnet 
boys and high Blnet girls as more hostile and Impulsive than high 
Blnet boys and low Blnet girls# Most of this effect seems to be the 
result of the very high impulsive scores given the high Blnet girls 
by their teachers* 

Factor III shows a significant sex effect, high/low Bluet effect 
and an interaction between the two# Thus, girls are more persistent than 
boys in problem solving* high Blnet children are very much more persis- 
tent than low Bine t children* hlgi Binet boys are more persistent than low 
Binet boys, but there is no difference in the persistence of hj^ Binet glr‘ 

Binet Face Sheet The thirteen items constituting this scale were 
factored and four interesting and usable factors emerged# (see appendix) 

These factors have bean named; I Emotional dependence; II Feralstence; 

111 Social skills and social confidence; and IV Dls tract ibility# 

Analysis of the pre test factor scores are summarised in Tables XVII-XX. 

Factor I shows just one effect; High Binet children arc less 
dependent than low Binet children. Again the same pattern with respect 
to the Hiiad Start experience emerges# Binet testers, who did not know 
the experimental status of the children* were ’ynsble to distinguish 
between Head Start and non-Head Start children in their dependent 
behavior while taking the pro test* although the testers could distin- 
guish between the high and low IQ children on this dimension# 

Factor II the persistence measure* distinguished between high and 
low IQ children, and none other* Factor III functions in the same manner* 

Factor IV, the dlstractibility measure has a significant main effect In 
the high/low Binet score as well, but there is also a significant inter- 
action between sex and experimental status on this dimension# Experimentsl 
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boys sre more dls tract lb le than control boys; control girls are more 
dlstractlble than experimental girls. 

Discrepancy scores . There ire two discrepancy scores of Interest: 
Blnet PPVT~R differences and PPVT-R-PPVT-E differences. In both Instances 
we expect the PPVT-R to show lower scores than the other two, although 
this effect Is not expected to be the same for all sub-groups In our 
populations. We are particularly Interested In the nature of these 
discrepancies at the end of the Head Start experience for Head Start and 
control children, for boys and girls, and for children above and below the 
median Blnet IQ* 

Table XXI summarizes the analysis of variance of pre Blnet-R dis- 
crepancies. All discrepancies are In favor of the Blnet over the Peabody. 
Girls show a greater discrepancy than boys, and high olnet children show a 
greater discrepancy than low Blnet children. Thus, girls average 7.5 
points higher on the Blnet chan the Peabody, whereas boys average 2.4 points 
higher on the Blnet than the Peabody, High Blnet children average 9.5 
points higher on the Blnet than the Peabody, whereas low Blnet children 
show no difference In their scores. If thes'^^ discrspancies tell us some- 
thing about the differences between boys and girls, and between high and 
low Blnet children, they do not tell us about any differences thst might 
exist between children who went to Head Start and those who did not, since 
there are no significant effects for experimental status. 

Table XXII summarizes the analysis of variance of the post Blnet-R 
discrepancies. After a year In kindergarten girls still do better on the 
Blnet than the Peabody to a greater extent than the boys. However, the 
distinction between the high and low Blnet children on this discrepancy 
me^i^re has disappeared on postest. This results from the fact that the 
high Blnet children (who scored significantly better on the Bluet than the 
Peabody on pre test) , lost a few points on the Blnet and gained a few on the 
Peabody over the year. The low Blnet children (who shewed no difference 
between their Blnet and Peabody scores on pretest) gained a few points on 
both tests over the year. The difference between their difference scores 
was thereby reduced. However, this change over time Is hard to Interpret 
since so much of the pre-post gains scores for these Instruments might be 
artlfactual. Of greater significance to the present work Is that Head 
Start children are indistinguishable from non-Head Start children on pre 
or post testing. This Is particularly important since the pretest dis- 
tinction between high and low Blnet discrepancy scores has constructive 
validity. 
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In o'ikder to examine the nature of the Blnet-R discrepancy measure, these 
scores T^ere compared to the other data In the present study which hear relevance 
to test-taking attitudes* Thus, It Is expected that If the Blnet-R discrepancy 
is the result of an inhibitory effect generated by the form of the Peabody test, 
then this discrepancy measure would correlate with Factor I and II of the 
Behavior Inventory Cwlthdrawn and impulsive, respectively), and with Factor IV 
of the Binet Face Sheet (dlstractiblllty) # However, no significant correlations 
were found among these variables, indicating that the discrepancy score may be 
reflecting dimensions of the child not recorded by the teachers or the testers, 
while at the same time it does differentiate ^between high and low Binet 
children to the same extent that the teachers and testers make this differenti- 
ation* 

Tables JOlIII and XllI/ siimmarize the analysis of variance of the pro FPVT E and R 

discrepancies* Here, a diffr^rent picture emerges than the findings for the 
Blnet-R discrepancy* The only significant effect is across high/low Binet 
scores, but in this case, the low Binet children do better on the E than the 
R, whereas the high Binet children do better on the R than the £* There are 
no effects attributable to the experimental status of the children, and there 
are no sex differences in these data* On post test, this effect disappears* 

This is due primarily to the change in discrepancy score of the low Binet 
children* At the end of the year, these children have improved on the R to 
the extent that the differences between their scores on the two tests, and 
the difference between the differences for the high and low Binet children 
have disappeared* This change in the magnitude of the discrepancy score due 
to a change in the scores of the low Binet children is the reverse of the 
effects noted for the Binet-R discrepancy* There the changes were related 
primarily to variations in scores of the high Binet children. In this case 
the changes are due to variations in the scores of the low Binet children* 

In order to examine the content of the E-R discrepancy, correlates in the 
factor structure.^ of the Behavior Inventory and the Binet Face Sheet were sought* 
However, as in the case of the Blnet-R discrepancy, no significant correlations 
were found* 



f<icli»y md tdteol affeeti fhe ta find meiif affaett attirlbutabia 

to anpatinintai atattia niQr ba tha raatilt of a nadblng affact dna to dlffatantlal 
parfomanoa of Uaad Statt and control dilldtan in dlffatant aaboola or in dif* 
fatant olaaaaa« It nay 'iba that taaohata or adioola diffar ayatanatieally in 
thair traatnant oE Head Start diildran cotaparad to thair traatnant of tha control 
childran* Coniaqnatttly» analyaaa acroaa taachara and acbool wara nan naing 
reading readinaaa acoraa and Behavior Xnvantory factor acoraa (tha naaanraa uaad 
by taachara or noat atrongly influancad by taachar behavior wiidi raapaet to tha 
diildran) aa tha dapandant aaaanraa* Both taachara and achoola ahoaad aignif leant 
aain ef facta against reading raadlnasa acoraa (tatdiara: F* $*9bt 18/156 » p*< 

001 sdioolss^ F« 8.84, df, 13/161, p-c.OOl), indicating that it nattarad greatly 
to a diild*a acadattic achiavanant lavel vhidi adiool ha attended and vhich teacher 
ha had. In order to detamina if the Head Start children ware differentially 
affected by the school or teacher, interactions between aicperinantal statns and 
school or readier were araainad. Ho significant interactions ware noted for 
these variables. Clearly, sone teadiars are associated with greatar raading 
readiness scores then others (the range of class percentile ie 20*82), but 
there Is no evidence that the kindergarten teediere were producing different 
levels of acadeade achiavenent in Head Start graduates than in non*Head Start 
children. {Tables IX?IX - xm) 

Althouih it is not part of goal of tha present inveetlgetion. It ie of 
Interest to go one step further in identifying the sources of variability of 
chtldren in reading readineaa at the end of the kindergarten year. Va wish to 
detemine if this vi^riabllity la the contribution of tba teaduit or the diildran. 
Thus, if there Is sows selective assignnent working so that diildran with greeter 
reading potential are aaeigned in sone wanner to cone teediere, then the variance 
acroes teediere at the end of the year ie attributable to the children end not to 
the teacher. (It is hard to imagine how such an aaaignnent ayatam could work at 
the beginning of kindergarten, although it would start to work tha first tint 
the children were re*aaaigned to teadiers. However, audi a differanca aeroaa taa* 
chera at the beginning of the year in theta adioola could occur by dianea.) Tha 
only aeaaure taken at the beginning of the kindergarten year Whidi mi^t be 
relevwit to this iseue ie the Binet. Consequently, e einple one w^ anelyeie of 
variance of teaehera egeinet pre Binet ecoree was run. Tha fw 2*48, df 18/196, 
p«< *01. This indicataa that there is significant varianca aeroaa teadiare on 



Blttet* although closeir examittatlon (Newman'-Koules test of multiple 
Indicates that almost all of this variance is contributed by a single teacher 
whose experimental and contrcl children at the beginning of the year had a 
mean IQ of 74. 3 • The class percentile In reading readiness for this teacher 
at the end of the year is 42.0 hich is well within the normal limits for this 
population. However, It is clear that the mean pre Blnet IQ 1® 18 of the xy 
classrooms are equivalent to each other and that at the end of ^ 

is very great differences across these classrooms. In order to substantiate 
this point, an analysis of covariance was run with the pre Blnet . 

the covarlate. With the pre Blnet scores covaried the S equals 2.68 df 18/iyt) 
>p is less^.ui, indicating that there are significant differences across 
teachers on reading readiness. Since the substance of differences between 
teachers cannot be examined further with the data of the present Investiga-* 
tlon, we shall report no further results of this analysis. Clearly, teachers, 
as well as p«pH characteristics, contribute meaningfully to leading readiness 
acliievement at the end of the kindergarten year. 



Teachers rated the children in their classes on the Behavior Inventory 
within a week of the administration cf the reading readiness tests. In 
order to determine if there are differences across teachers In the assign- 
ment of such ratings, and if there are Interactions between teachers and 
experimental status, an analysis of variance of each factor score was run. 
labels XXVII-XIX sunouarize these analyses. In each Instance there is a 
significant main effect for teachers* Factors I and III also show a 
significant effect for sex (boys are judged as more withdrawn than girls, 
and girls are judged more persistent than boys). Of interest, however, 
are interactions between experimental status and teachers on factors 1 and 
III. Visual examination of these analyses reveals that there Is no 
systematic direction to the interaction, which are weak at the most* In 
both instances the largest source of the effect comes from a single teacher, 
who judged Head Start children as more withdrawn and lacking in persistence 
than control children. This teacher appears to be different from all 
other teachers in these judgements. It would be difficult to attribute 
any further significance to these interactions. Although It is not possible 
to ascribe these post characteristics to teachers, since there are no pre 
Behavior Inventory ratings with which to make comparisons, it is clear that 
at the end of the year, children differed on these characteristics depending 
upon the teacher with whom he has been associated. This effect might, of 
course, reflect a teacher responsible bias rather than real differences 
between children, but this analysis is beyond the purposes of the present 
study. 
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Inteyrelatlotts between liiitriiaents * We are interested In certain 
interrelations betureen the measures for a number of reasons* First* 
although several of the measures differentiate between children* albeit 
not Head Start and control children* it Is necessary to know if these 
measures are related to criterion measures of theoretical Importance. 

Thus* It Is Important to know if the measures we have taken predict 
reading readiness with efficiency. If so* the Interpretation of the 
results reported above Is clarified, 

NeKt* it Is clear that the only measure which shows significant 
improvement over the year is the PPVT-R, In order to interpret this, 
the correlates of such a gain would he helpful. 

Finally* the discrepancy between the Blnet and the PPVT-*R scores 
are very variable, and of theoretical significance. This discrepancy 
score Is not related to several variables which serve as construct validity 
criteria. Consequently* a further investigation of Its correlates Is 
necessary to help interpret these data. 

Each of these Interrelations was carried out by way of a stepwise 
multiple regr jlon analysis summarized In Tables XXX^ ItXIfT and XXXIT* 

In predicting scores on the MRRT, it is clear that the set of predictor 
variables (the Blnet IQs* Peabody IQs* Blnet Face Sheet factor scores* 
and Behavior Inventory factor scores) account for a very large portion 
(66*51) of the variance of the criterion variable. The variables with 
the largest beta weights (1.02 - 2.53) In this set of predictors are 
the Peabody scores* the Blnet IQ* and Behavior Inventory factor 111 
(Persistence) . Our purpose here Is not to make predictions of the 
criterion variable* but to demonstrate that the measures in our battery 
are meaningful with respect to reading readiness* It is clear that this 
Is true, particularly for the Persistence score and the Blnet IQ. 

The PPVT*R gain score Is of Interest because it Is the only gain 
to reach significance. Hare* examination of the iteration sequence 
shows that the two discrepancy measures constitute the best set of 
predictors accounting for 3IX of the variance probably because both 
contain the PPVT-R as a component. As reported above* low R scores (and 
consequently high discrepancy scores) are associated with high R gain 
scores* The full battery Including those first two variables account for 
41% of the variance of R gain scores indicating that there are few 
correlates of any meaning for this variable. This Is conslsteiit with 
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the aotioii that 1I?FVX*E geiu seoires ete leifgel^r the eaepreeeioii of e tegreeiioe 
phenoMiiiosi tethet thee aeaElngful gelne* 

EsMidiietioo of the Itexetioo eequeace for the Biiiet**E dieorepeo^ ehoirs 
that the firet three variables constitute the best set of predictors » 
accomtlug for 471 out of the total of 571 of the variance accounted for hy 
the full battery. Once agaln» the variebles with construct validity for 
this variable (the Behavior Inventory and Bluet Face Sheet) do not contribute 
neanlngfully to Its variance. The £**E discrepant » idilch Is the first 
variable In the Iteration, and the PPVT->E, which Is the second, both should 
reflect the sane phenonenon as the Blnet'-E discrepancy, vis., expressive 
tasks such as the PPVT*£ and Blnet, represent lesser denands for the 
populations used in the present investigation. Operating In consort, these 
variables seen to relate to each other although external validity Is still 
ladklng. 

Piscusslon . 

The nost inportant finding around which this discussion nust center 
is the failure to discover any effects associated idLth the Head Start 
experience. This includes any effects Innediately after the Head Start 
suaner, and those longer tern effects of alnost a year later. The 
expectation that a suaier of Head Start nay serve to Inoculate the child 
against the sources of acadenlc failure and, therefore, show up sonatina 
after the injection to protect when the disease attacks, does not appear 
to be fulfilled In these data. The first Issue raised by these findings 
is whether, in fact, the battery of neasurenants Is sensitive to the kinds of 
changes which night have been produced by Head Start. This Is an unanswerable 
question, of course, but an approxlnatlon of an answer can be made. Clearly, 
many naasures of soclal*enotlonal skills were not Included, and nany cognitive 
dlJieasions were not tapped. Motivational dianges ware not conplately 
sanpled. Further, nany of the school-related naasures that were taken In this 
study are not necessarily Included as part of the stated goals of Bead Start. 
Hevertheless, the effects of an educational progran nust ultlnately be felt 
in acadenlc work. It night well be that this ultlnate effect Is produced by 
changing the Child's sense of self worth, notlvatlon to learn, sense of 
nastery over the envlronnent, anxiety levels, and a feeling of ccnfortableness 
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in the educational setting. But ultimately* these changes should show up in 
academic achievement* if some external event (such as the behavior of the 
teacher is post-Head Start experiences) does not Intervene to destroy the 
effects* We feel justified therefoie in using reading readiness scores at 
the end of the kindergarten year as the criterion for the battery of 
instruments In this kind of evaluation study. The finding that a multiple 
regression analysis can be generated which correlates .85 with the criterion 
variable* is very supportive evidence that the instruments used In the present 
study are quite relevant to educational output. Further, the finding that 
both the cognitive and personality measures differentiated between groups of 
subjects who are knoimto be different (boys and girls; high and low performers 
on the Binet) , suggests that the instruments are sensitive to Important 
differences between subjects. This underscores the failure to find any 
differences associated with Head Start experiences. 

Our battery cf instruments Included two measures of the childrens* 
behavior: Behavior Inventory, and the Binet Face Sheet. In both cases a 

clear syndrome of behaviors is associated with the Binet IQs and with reading 
reidiness. Children who are persistent in solving problems^ relatively 
independent* not overly timid and withdrawing, and socially confident* are 
cleat sy tiie successful children in school and on the Bluet test {zero order 
correlations between all of these variables are significant as well as their 
analyses of variance and multiple regression analysis). If these variables 
can differentiate between those who are high and low on a measure such as the 
Binet* and can predict relative success in school, it could be expected that 
an educational urogram such as Head Start could be evaluated by them as well* 
it is true that the Behavior Inventory was administered by teachers, and this 
might aerve to obscure or distort difference between children on these measures. 
That is, teachers might be rating children dlffereiitially according to their 
status as Head Start graduates. In fact, there was no interaction between 
teachers and experimental status. Indicating that teachers were telling the 
difference between children but not along the Head Start-non-Head Start 
dimension. Further, the Binet testers who rated the children in the Binet 
Face Sheet with respect to their test taking behavior, did not know which 
children were experimental and which were control children. Ho differences 
between these groups occurred on the Face Sheet measures, although other 
differences did 



occur* 









The Peabody Picture Vocabulary Test (R) should be examined further » 

This Instrument has shown two characteristics when administered to deviant 
populations s it generates IQ scores which are typically below those of the 
Blnet and it produces rather large gain scores* These factors* together 
with Its ease of administration probably accounts for Its popularity In 
evaluation studies of programs such as Head Start* There are* however, 
several reasons why this may be an unjustified popularity* 

An instrument that systematically depresses scores (relative to the 
Blnet) may be perfectly usable if one knows the extent of the systematic 
depression in order to make the appropriate adjustments. However, in this 
case, it is apparent that the di'^crepancy between the Blnet and the Peabody 
is very much a function of the Binet score* The discrepancy is significantly 
higher for high Blnet scores than for low Blnet scores* This finding 
replicates the finding of the present writer on a younger population (Cline, 
et al, 1966) and indicates that the discrepancy is not simply a matter of 
the scoring system of the Peabody or the reduced verbal content of the 
Binet at early age levels. In the previously cited work, the present writer 
suggested that this discrepancy reflects the inability of test-anxious 
subjects to cope with the kind of receptive task provided by the Peabody. 

There is a great deal of variability in this discrepancy, but in the present 
study. Its only correlates are the Binet scores and other discrepancy 
measures Involving the Peabody* It does not relate to any other construct 
validity criteria in the present study* The low correlatlons^e tween the 
Peabody (R) and the Blnet (.44 In the present study, compared^. 75 as 
reported in the manual for the Pecbudy) results in part from the restricted 
age range of the present population compared to the age range in the standard* 
Ization population of the Peabody, and in a slightly restricted range of 
Binet IQs. But there is the clear implication that these two jnstrumeiits 
are not measuring the same functions despite the claims of the Peabody test 
constructors. It is not clear what the Peabody is measuring, although it is 
clear that it underestimates the higher Binet scores more than the lower 

Binet scorers* 

The large gain scores which seem to be characteristic of this inrtrument 
wer« found in the present population as well* One might be tempted to say that 
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the cotuie<|iiefice of e year In klodergaften ptodtioed an l 1 ^>l^ 01 rettattt In Feahody 
acorea if It vere not for another finding of this study* The gain aeorea on 
the FeabodF restricted ali&ost entirely to those diildrsn Who scored helow 
the median score of the Peabody on pretest* Those aboue the aedian show no 
gain. An alternative explanation <to the hypothesis that this is a true 
experinental finding) » is that such a pattern should be found with randon 
changes from pre to post on an instruiiient which produces disproportionately 
low scores. This is one half of the regression phenoaenon in whitb there is 
greater regression to the mean found in the more extresMi scores than in the 
less extreme scores* The large gains shown on the Peabody» appear to be 
due to the random fluctuations of the very large numbers of scores at the 
very low end of the Peabody distribution. This explanation needs to be 
examined in much greater detail than our present purposes allow* It is 
reasonable to assert, however, that until the regression hypothesis is re- 
jected, Peabody data collected from low income preschoolers must be considered 
suspect* 

In order to examine the suggestion cited above, that the Peabody (R) 
underestimates IQ scores because of its uniquely receptive nature, an ex- 
pressive form of the Peabody was developed by the present writer. It was expected _ 
that this form (E form of the Peabody) would produce higher scores than the 
receptive form. This did occur, but only for those subjects who are below the 
median Binet score* Thus, for low Blnet children, the £ form is easier than the 
R, and for high Binet children, the Binet is easier than the 1* On post test, 
both of these discrepancies disappear, primarily because of the large (and 
perhaps spurious) improvement in R scores. (Binet-R and E-R correlations with 
PPVT-R gain score are c50 and *47 respectively indicating that large discrepan- 
cies are caused by low is which produce high R gain scores)* 

Although little construct validity is available for these discrepancy 
measures, it is clear that they differentiate between Binet scores* They do 
not differentiate between Bead Start and non-Head Start childran, but sines 
it has not yet been possible to demonstrate that they are related to teat 
anxieties, it is difficult to interpret this finding* 




A good doal of the ooBpmtieoii betireeti Heed Stett end oonttol ohildtea 
is based on the essuaiptioii of theii: equlireleoce prior to Head Start » the 
only data relevant to this point are the pretest measures taken a aonth or 
two after Head Start ended. It would be reasonable to assuae that if there 
are any Head Start effects in these data» it would show up at that tias* 

There are, of course, two reasons why significant gains attributable to 
Head Start would not show up on testing in October* The first is that six 
weeks to a aontb of kindergarten would wash out any such effects* The second 
is that the true gain of the Head Start children was to catch up to the control 
children who were acre advanced than the experiaental children at the tiae of 
entrance to Head Start* This latter point is less convincing because if the 
Head Start children did, in fact, start out at a lower level and slaply caught 
up in the suaaer, then soae kind of reflection of that would be found in one of the 
tests, or in one of the gain scores* A child Who has been able to aake large 
gains in a suaaer prograa should be able to exhibit soae aoaentua for a few aonths 
at least. The only alternative to thia is that the gain over the siamer was so 
tenuous as to be easily dispersed by the beginning of school. Such a gain would 
be hardly worth striving for. 

The hypothesis that a gain aight have bean preaant ,, but washed out by the 
kindergarten teacher dnrlng the first few aonths of school, is a aore reaaonCble 
one. If thia la the case, however, it aesns thst the Head Start child auf fared 
aore at the hands of the teacher than the non*Haad Start Child* It was to teat 
this hypothesis that the several interactions batwaen teachers and experiaental 
status were examined* None were found to be close to algnif leant. It is clear 
that the kindergarten teacher does exert a graat deal of iofluanca over the 
performance of the child, and if that influence was nagatlva, it aight be great 
enough to obscure differences between experiaental and control children during 
the first few months. This is a clear possibility that cannot he examined in 
in the present study. Teacher behavior is an axtramaly important and, as yat, 
unexsmlnad variable in this investigation. Tha answer to these questions can 
only come from careful ohaervation of teachar behavior from the very first di^ 
of school, together with measures of the Children tiken after Head Start hut 
before kindergarten* 



Conclusions 



We have attenipted to sample from the populations of teachers 
and children of the kindergarten classes of the Negro ghetto of • 
Washington^ B«,C* Half of the children in the sample attended summer 
Head Start preceding kindergarten; half did not* Although not strictly 
equivalent in the sense of the random sampling models the two groups 
of children were drawn from the same socioeconomic stratum. The 
qualifications concerning equivalence ^ discussed above ^ restrict 
the generaliaability of our conclusions to a population which is^ 
perhaps^ somewhat less transient than the Head Start population for 
the whole city. However^ In the most limited sense ^ the findings 
do represent the probable effect of Head Start on the academic 
readiness and social-emotional development of economically impoverished 
children of the District of Columbia. 

The sampling of programs was representative of chose offered 
lower Income children by the D.C. Board of Education. Even without 
an in depth analysis of either Head Start or kindergarten programs^ 
it is possible to conclude that no discernible effect of Head Start 
appeared.^ Whether this failure lay with Head Start, or whether some 
kind of stereotyped teacher responses in the kindergartens obscured 
Head Start effects. Is still not clear. > 

Another conclusion to be drawn is that the variability observed 
across our several variables was much greater than would be expected 
by chance. To be culturally disadvantaged clearly has no uniform 
meaning. The children of the lower income families are much more 
heterogeneous than is often supposed. It is entirely possible, 
therefore, that the failure of teachers described above, depends on 
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their lack of awareness of the range and variability of the skills 
present in their classes ^ and the nature of the deficiencies likely 
to be present In individual children. 



A final concluBion^ and one that must serve as a vmrnlng to 
researchers in this fields is that scores on reliable, much used 
instruments such as the Peabody have both more variability and less 
validity among children of the urban poor than upon the children 



on whom 



y were 







1 
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TABLE 1 



tiufiber of children receiving each of the 
neaaures* 



Pre»Testa 





Blnet 


PPVT-R 


PPVT-E 


Experiaent 


112 


111 


113 


Control 


103 


100 


105 


Total 


215 


211 


218 


Invalid 


0 


0 


0 


Total (usable 


215 


211 


218 



teats) 







Post-Testa 



Blnet 


PPVT-R 


PPVT-E 


BHI 


HMBT 


87 


103 


87 


104 


102 


78 


S3 


77 


91 


92 


165 


136 


164 


195 


194 


1 


2 


0 


0 


6 


164 


184 


164 


195 


194 






; 

I j 



' i 




TABLE III 



PRETEST MEASS PUR SUBJECTS 
WITH AND WITHOUT POSTESTS 



With Postest 



Without Postest 



N 


BlNET j 


N 


PPVT-R 


N 


PPVT-E 




1S9 

! 


89.98 

1 


176 


84.72 


166 


27.04 




56 


89.43 


35 


87.00 


56 


28.27 



TABLE MEANS AND STANDARD DEVIATION OF PRE AND POST 

PPVT-R, PPVT-E and STANFORD BINET 
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top nunber in each cell is the Mean; the lower is the Standard Deviation. 



TABLE V 



ANALYSIS OF VARIANCE, PEE BINET 






SOURCE 


M 


m 


£ 


Smz (A) 


1 


76*57 


1.07 


Experimental 
Status (B) 


1 


182*16 


2*54 


High /Low 
Blnet (G) 


1 


2*2426*06 


313.316*** 


A X B 


X 


217*14 


3.03 


A X C 


1 


29.96 


0.419 


B X C 


1 


124.73 


1*74 


A X B X C 


1 


87*87 


1*23 


Error 


207 


71.57 


1.00 



?<*001 



o ' 

ERIC 








TABLE VI ANALYSIS OF VARIANCE, BINET GAIN 



SOURCE 


DF 


JK 


F 


Sex (a) 


1 


12.47 


0.134 


Exper.lmental 
Status (b) 


1 


•« 

55.84 


0.601 


High/Low 
Blnet (C) 


1 


1948.07 


20. 957 * 


A X B 


1 


23.77 


0*256 


A X C 


1 


0.895 


0.010 


B X G 


1 


18.39 


0.198 


A X B X C 


1 


139.17 


1.497 


Error 


153 


92.96 


1.000 



*** p <; .001 











TABEE VII. ANALYSIS OP VARIANCE, PRE PPVT-R 



mcE 

: (A)- 

>erlmental 
Status (B) 

jh/Low j 

Inet (C) ! 

1C B j 

1C c 

K C 

X B X C j 
bor 

■ 

I • 

^ P ,^01 



DF 

1 

1 

1 

1 

1 

1 

1 

194 






MS 

670.369 

68.578 



7OOB.75O 

51.098 

3.980 

74.930 

13.064 

217.037 



F 

3.089 . 

0.316 

32.293 *** 
0.235 

0.018 



0.060 

1.000 
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TABLE VIII. 



ANALYSIS OF VARIANCE, PPVT-R GAIN 



SOURCE 


SE 


Jg, 


F 


Sex 


1 


66. 84 


0.270 


Experimental 
Status (B) 


1 


6.82 


0.280 


High/Low 
Blnet (C) 


1 


1.48 


0.006 


A X B 


1 


18.96 


0.077 


A X C 


1 


127.90 


0.516 


B X C 


1 


67.96 


0.274 


A X B X C 


1 


875.32 


3.534 


Error 


163 


247 . 6 ? 


1.000 



TABLE ^ 



ANALYSIS OF VARIANCE PPVT-E (PRE) 



SOURCE 

Sex (A) 

Experiraeiital 
Status (B) 

High/Low 
Biuet (€) 

A 1 B 

A X C 

B X C 

A X B X 

Eraror 



M 


MS 


F 


1 


5426453,875 


5.453 * 


1 


116893.243 


0.117 


1 


7412862.643 


7.449 ** 


1 


75143.802 


0.076 


1 


728691.796 


0*732 


1 


9974.323 


0.010 


1 


737811.485 


0.741 


198 


995082.508 


1.000 



^ P < .05 
P ,01 




TAB 03 X. ANALYSIS OF VARIANCE, PPVT-E GAIN 



SOURCE 








F 


Sex (a) 




1 


59 ^ 1 - 721.99 


0.478 


Experimental 
Status! (B) 


t 

1 


191322.21 


0.154 


High/LoJ 
Binet 1 (C) 

ii 


1 


6649088.81 


5 . 342 * 


A X B 


! 


1 


9832. 51 


0.008 


A X C 




1 


2707371.86 


2.175 


B X C 




1 


357633.15 


0.287 


A X B X 


c 


1 


730438 . 01 


0.587 


Error . 

! 




149 


1244687.94 


1.000 


* p < 


• 05 
















anford-Blnet 



Stanford-Binet, PPVT-R and Pp-VT-E 

PPVT*-R 

< $ I f ' > 

,446 



(Pa?e-Tests) 

PPVT-E 



.315 






TABLE XII* AHALfSIS OF TAMAMOl, 
Metropolitan Reading Readiness. Test : Total Score 



'>WBOE 


DF 


MS 


F 


&x (A) 


1 


1831.247 


6*750 ** 


Kpei*iiwe nt a 1 
Status (B) 


1 


228.870 


0*844 


igh/Lo?y 
Blnet ( 0 ) 


1 


14136.379 ■ 


52*105 **' 


X B 


1 


48.991 


C.lBl 


X 0 


1 


272.296 


1*004 


; X C 


1 


40, 570 


0*150 


X B X C 


1 


158.369 


0 * 5 B 6 


1 

trror 

1 


167 


271,303 


1*000 


r 

U ?<*01 








p 

U-# p <^001 
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Ler|c 

K( — 

■ 
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miM 






iitaiiiiiiiBiiMiiiiMii^^ 



illlilllfiiBiliHhi 









JRCE 

K 



TABLE Xm* ANALYSIS OP VARIANCE^ 
Metropolitan Reading Readiness Test ; Class Rank 



gh/Low 



(C) 



X B 
X C 
X C 

X B X 0 



1 

1 

1 

1 

1 

l8l 



MS 



0.272 

21.333 
0 . 
0.234 
0.394 
0.068 
0.470 



•»? 



*** 



0 . 

45.438 **» 

0.999 

0.499 

0.840 

0.145 

1.000 



►» P< .001 




TABLE XIV. ANALYSIS OP VARIANCE, BEHAVIOR INVENTORY 

FACTOR I 



SOURCE 


m 




P 


* Sex 


|A) 


1 


27.62 


0.300 


Experimental 
Statue, (B) 


1 


133.00 


1.445 


High/Low 
Blnet (C) 


1 


2901 . 68 


31.534* 


A X 


B 


1 


306.21 


3.328 


A X 


1 


65.57 


0.713 


B X 


C 


1 


6.05 


0.066 


A X * 


B X C 


1 


27.60 


0.300 


Error 


197 


92.02 


1.000 



*** P < .001 














TABLE XV. ANALYSIS OF VARIANCE^ BEHAVIOR INVENTORY 

FACTOR II 



SOURCE 


m 


MS 


F 


Sex (a) 

1 


1 


99 


1.351 


Experimental 
Sta|:us (B) 

ii 


1 


13. 93 


0.126 


High/Low 
Blnet (c) 


1 


57.02 


0.514 


A X B 


1 


129.86 


1.17 


A X C 


1 


1 


534. 02 


4.810* 


B C 


1 


1 


270,30 


2.435 


A X i 


l X C 


1 


12.81 


0.115 


Error 


» 


197 


111.01 


l.OOO 



* P < ^05 
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TABLE ro, ANALYSIS OF VARIANOB^ BEHAVIOR INVENTORY 

FACTOR III 



SOURCE 

Sex (a) 

Experfi mental 
Stsitus (B) 

! 

High/Low 
Bluet (C) 

A X B 

A X C 

B X 

A X 

Error 




m 


m 


P 


1 


584.35 


6.83 «* 


1 


99.07 


1.158 


1 


1830.89 


21.40 *** 


1 


17.04 


0.199 


1 


498.37 


5.825 * 


1 


2.17 


0.025 


1 


8.10 


0.095 


207 


85.563 


1.000 



* P < .05 
** P V 7oi 
*001 















TABLE XVII 


ANALYSIS OF VARIANCE, 


Pre Binet Faee Sheet; 


Factor I 


j SOURCE 


DF 




F 


1 Sex (A) 


1 . 


54*155 


0*506 


1 Experimental 
^ Status (B) 


1 


0*192 


0 . 002 V 


High/Low 
Binet (C) 


1 


1129*715 


10.563** 


A X B 


, 1 


3*611 


0*034 


A X C 


1 


23.063 


0*216 


B X C 


1 


1.150 


0.011 


A X B X C 


1 


152.263 


1.424 


Error 


190 


106*948 


1.000 



**!?<. .01 




mdum 



* 
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TABLE XIX • ANALYSIS OF VARIANCE, Pre Bitiet 



Sheets Factor 



jURCE 
j ' (A) 

Kperltoteatal 

Status (B) 

r 

Lgh/Low 
duet (C) 

X B 

X C 

X C 

X B X C 



DF 


MS 


F 


1 


296*073 


1.984 


I 


34.815 


0.233 


1 


851.218 


5.705 


1 


24.297 


0.163 


1 


0.011 


0.000 


.1 


247.864 


1.661 


1 


16.960 


0.114 


190 


149.210 


1.000 



rror 



TABLE XX 



ANALYSIS OF VARIANCE, Pre Bluet Face Sheet; Factor IV 



SOURCE 


DF 


m 


F 


Sex (A) 


1 


288.594 


2.867 


Experitaental 
Status (B) 


1 


1.988 


0.020 


Kigh/Low 
Bluet (C) 


1 


2794.507 


27.765*** 


A X B 


1 


681.283 


6.769** 


A X C 


1 


1.437 


0.014 


B X C 


1 


15.845 


0.157 

<V 


A X B X C 


1 


14.443 


0.143 


Error 


190 


100.649 


1.000 



t 

i 



I ** F < .01 

I- 

I p ^ ^001 



i 

\ 



i 


















TABLE XXI * ANALYSIS OF VARIANCE^ 

Pre Dlscrepancsr Score : Stanford-Binet « PPVT-R 



BF MS F 



1 


1270. 576 


5.791 


1 


51.205 


0,233 


1 

1 


3969.262 

45.56* 


18.092 

V 

0.208 


1 


70.263 


0.320 


1 


4.701 


0.021 


1 


100.456 


0.458 


194 


219.396 


1.000 



f 





r. 



TABLE XXII . ANALYSIS OP VARIANCE, 





Post 


Discrepancy Score : Stanford' 


-Bihet - 


lURCE 


BP 


MS 


F 


■X (A) 


1 


971. 


3.942 


:pei*iniiental 
Status (B) 


1 


1.351 


0.005 


.gh/Low 
Binet (C) 


1 


144.951 


0.588 


X B 


1 


87.854 


C.356 








X C 


1 


3.282 


0.013 


X C 


1 


15.788 


0.064 


X B X C 


1 


1.498 


0.006 


r»ror 


l46 


246.534 


1.000 



P< .05 














PP'VT-R 





TABLE XXIII, ANALYSIS OF VARIANCE, 
Pre Discrepancy Score; PP*VL~E « PPVT-R 



URGE 


DF 


MS- 


F 


x (A) 


1 


26oa03. 421 


0,214 


perimental 
Status (B) 


1 


28C835. 970 


0.233 


jgh/Low 
Binet (c) 


1 


7616119. 177 


6,271 * 


X B 


1 


23348.428 


0,19 


X C 


1 


367568.504 


0,303 


X c 


1 


142279.098 


0.117 


ix B X C 


1 


1384582.394 


1,140 


ror 


190 


1214567.128 


1,000 


P< *05 
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TABLE XXV • ANALYSIS OP VARIANCE 
Metropolitan Reading Readiness Test 

Sex, Experimental Status and Teacher Effects 



SOURCE 


BP 


MS 


F 


Sex ( a ) 


1 


1791.615 


9 . 534 ** 


Experimental 
Status (B) 


1 


18,318 


0,097 


Teachers (C) 


8 


1541.191 


8,202 *•* 


A X B 


1 


88,178- 


0.469 


|a X C 


8 


302,824 


1.611 


B X C 


8 


86.300 


0,459 


A X B X C 


8 


147 , 922 


0,787 


Error 


80 


187.915 


1,000 


** P < .01 








*** P < .001 














TABLE XXVI , ANALYSIS OF VARIANCE 
Metropolitan Reading Readiness Test; 

Sex, Experimental Status and School Effects 



fRCE 

: (A) 

jeri mental 
Status (b) 

lools (C) 

C B 



C 
C 

X B X C 
i*or 






DF 

1 

1 

7 

1 * 

7 

7 

7 

115 



MS 

2486.229 

71.866 

2389.778 

9.944 

242.082 
106.022 
187 . 709 
222.807 



F 

11.159 *** 
0.323 

10.726 *** 
0.045 

1.087 

0.476 

0.842 

1.000 



* p .001 



ERIC 
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TABLE XXVII. ANALYSIS OF VARIANCE, Behavior Inventory I: 
Sex, Experimental Status and Teacher Effects 



fURCE 


zy 


MS 


F 


IX (A) 


1 


372.972 


5.669 * 


cperimental 




62.752 


0.954 


Status (B) 


1 


sachers (C) 


8 


779.926 
* ♦ 


11.855 * 


ix B 


1 


1.659 


0,025 


X C 


8 


108.869 


1.655 


X C 


8 


148.901 


2.263 * 


X B X C 


8 


14.433 


0,219 


?ror 


101 


65.787 


1,000 



P < .05 
►* P < .001 







TABLE XXVIII. ANALYSIS OF VARIANCE, Behavior Inventory II: 
Sex, Experimental Status and Teaoner Effects 



rRCE 

(A) 

jerimental 
Status (b) 

ohers (C) 

C B 

K C 

K 0 

k B X C 
ror 



DP 


MS 


P 

i 


1 


44.307 


0. 885 


1 


2.328 


0.047 


8 


1826.836 


36.493 


1 


0.086 


0.002 


8 


33.269 


0.665 


8 


26.190 


0.523 


8 


5.400 


0.108 


101 


50.060 


1.000 



* P .COl 



O 

“ERIC 














TABLE XXIX . ANALYSIS OF VARIANCE, Behavior Inventory III: 



• 


Sex, 


Experimental Status and 


Teacher Effects 


lURCE 


DF 


MS 


F 


X (A) 


1 


647.751 


n.907 *** 

Z' 


[perimental 
Status (b) 


1 


31.636 


0.582 


sachers (c) 


8. 


715.034 


13.144 *** 


X B 


1 


9.394 


0.173 


X C 


8 


116.043 


2.133 


X C . 


8 


129.956 


2.389 


X B X C 


8 


22.833 


0.420 


rror 


109 


54.399 


1.000 



ft* p< .001 



ERJC 
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XXX . Multiple 
Variable - MRRT 



Regression 
(Total Score) 

Increase in 



ehavlor Inventory III 
?ost Binet IQ 
?re Binet Face Sheet Z1 
?re Binet IQ 
?re Binet Face Sheet IV 
?re Binet Face Sheet I 
pre PFVT-R IQ 
Behavior Inventory IT 
Post Binet Face Sheet IV 
i>re E-R Discrepancy 
?re PPVT-E (T Score) 

Post Binet Face Sheet II 
Post E-Blnet Discrepancy 
Post PPVJ?-E (T Score) 

Post E-R Discrepancy 
Post PPVT-R IQ 
Pre Binet Face Sheet III 
Behavior Inventory I 



Multiple R 


s 


.649 


.421 


.718 


.094 


.731 


.019 


.742 


.015 


.755 


.019 


.763 


.013 


.772 


.013 


.776 


.006 


• 

00 

0 


.005 


.782 


.003 


.803 


.033 


.805 


.003 


.807 


.002 


.812 

> 


.009 


.813 


.001 


.815 


.002 


.815 


.0002 


.815 


.0001 

Total R^= .665 



c 

ERIC 



TABLE XXXI . Multiple Regression 

Criterion Variable - PPVT ~ U Gain Scores 

I Increase in 

riable • Multiple R R^ 



•e E-R Discrepancy 


.568 


.323 


•e Binet-R Discrepancy 


.605 


.043 


»e Binet IQ 


.612 


.007 


:»e Binet Face Sheet III 


.622 


.012 


bhavior Inventory III 


.629 


.009 


imT Class Rank 


.634 


.005 


pe Binet Face Sheet I 


.636 


.003 


pe Binet Face Sheet II 


.640 


.004 


ehavior Inventory II 


.641 


.001 


ehavior Inventory I 


.'642 


.0006 


re. Binet Face Sheet I 


.642 


.0004 






Total R^= .4128 







TABLE XXXII, Multiple regression 

% 

Criterion Variable - Pre Binet-R Discrepancy 



•iable. 



I 






E-R Discrepancy 
PPVT-E 
T Class Rank 
avior Inventory II 



Binet Pac 



e Sheet III 



Binet Facje Sheet I 

; • I • 

Binet Face Sheet IV 
^avior Inventory I 
ihavior Inventory III 



Multiple R 



Incregse in 



408 


.166 


6o4 


.199 


688 


.108 


714 


.036 


728 


.020 


7332 


.007 


740 


.010 


745 


.008 


753 


.011 


Total = 


.568 



APPENDIX 



A Instructions for Administration of PPVT— Exprossive 

i ■ Items and Scoring Criteria for PPVT-Expressive 



- — ^ ^ . 
p 


Table 1 


PPVT-Expressive, Reliability Coefficients 


G 


Table II 


Means and Standard Deviations for Stanford- 
Blnet by Testers 


H 


Table III 


Means and Standard Deviations for PPVT-R 
by Testers 


I 


' 

Table IV 


Means and Standard Deviations for PPVT-E 
by Testers 


J,K 


Table V 


Behavior Inventory: Factor Structure 


L 


Table VI 


Stanford-Blnet Pace Sheet: Factor Structure 


M 




Younger Sample 


N 


Table VII 


Younger Sample; Means, Standard Deviations 
and Mean Discrepancy Scores for Stanford-Binet 
PPVT-R and PPVT-E 






PEABODY PICTURE VOCABULARY 



i 

EXPRESSIVE FORM 



1. R«cord th« child's niM, cod* nuid>cr, your naat, d*t* sod school 
oa th* answer fom. 

2. Follow tho general adnlnlstratlon instructions outlined in the nsnaal 
(page 6), itew 1~6. All children, regardless of age, are given all 
the iteM indicated on the answer sheet. 

3. Begin the test by saying: I want to play a pict ure gene with yw^ 

Turn to Exanple A, and say: See all the pictures on this PMS, I 

ifill point to a Picture and I want you to tell m what it is> 

Let's try one. Point to the picture of the bed. I^en the subject 
makes the desired response, say: That's a good answer and turn to 

ExaiVle B. On Exaeple B, point to butterfly and ask: Mow whet 

is this? Do the sane for Exaeple C. 

4. Turn to Plate 25, but before showing it to the diild, say: Mew 

1 an toinx to show you sons other pictures. Eadi t ine 1 will 
point to a picture and I want you to tell ns what it is . When w 
set further alone, there nay be some pictures you are net sure of, 
but I went you to try to tell ne about it anyway. 

5. Show the child Plate 25, and say: All ri^t, let's try this one. 
Point to ''cone,'' and record the child's response ycrbatln. 

Administer all itens, through nu«d>er 60, following the procedure 
outlined above. Mote: Plates 33, 35, 38, 44, 48, 54, and 56 are 

not shown to the subjects. 

6. On itens which haye "probes" indicated, administer the probe for 
any response whidi cannot clearly be scored as 2. Ask the probe 
exactly as stated on the answer for. Where more than one probe 
is given, ask one or the other probe but not both. If a probe is 
asked, write "(Q)" preceding the subject's response. 



o 

ERIC- 


















m 






PPVT-Expresslve Test 
Items and Scoring Criteria 



CONE 



2 - Cone plus ice cream. 



1 - Cone or ice cream. 

ENGINEER, Probe. "What kind of man" 



2 




truck driver. 




1 ~ Man and vehicle mentioned but driving function not specified. 
Describes man’s role as function of uniform, i.e., milkman 
r. PEEKING, Probe; What is she doing 




1 - Girl in some other activity, i.e., climbing the tree, holding 
the tree. Boy plus hiding or peeking. 



B. KITE, Probe: What does it do 
2 - Kite plus activity (fly) 

1 - Kite or activity. 



1 - Other mammals similar in configuration, I.e., squirrel, rabbit 



10, TIME, Probe: What does it do or what Is it for. 

2 ~ Clock or Watch plus tells time. 

Time as a sequential event. 

I 1 - Clock or watch. ^ 

51 . SAIL, Probe: "What kind of , . — 

2 - Boat plus sail (or wind blown). Sailor boat. 



;9. RAT 

2 - Rat, mouse (hampster, guinea pig) 



raccon. 



1 - Boat. Ship. 



B 




32. AMBULANCE. Probe: ”What kind of truck (or car)*' 



Car'^or^truck plus function, ??o?^^do^or^s^trLk'^^^ 

i.e., car to take sick people to the hospital, doctor s > 

hospital car. 



1 - Car or truck with emergency but non-medical function, i.e., 
firetruck, police car. 



34. SKIING. Probe: "What is he doing" 



2 - Boy (or girl) plus winter snow activity, i.e., skiing, sledding, 
sliding down the snow. 

Skiing. 



36 . TWEEZER. Probe: "What is it called" 



2 - Tweezers. Any gripping tool, i.e., pliers. 



1 - Function dexcrlbed without specific mention of the tool, i.e., 
gets splinters out, takes out teeth. 



37 . WASP. 



2 - Response must specifically indicate a flying . 

stinging function or mention one of these specific insec s 

wasp, bee, fly, hornet, mosquito. 



1 - Flying insects, i.e., butterfly, moth . 

Insect or bug (spider or ant not acceptable;. 



39. 



40. 



PARACHUTE. Probe: "What is it called" or "What is it doing 

2 - Parachute plus activity, i.e., carrying, floating, coming down. 

1 - Parachute or activity. 

SADDLE. Probe: "What is it called." 



^ “ Ascription of function (horse and seat), i.e., horse thing 
to sit on. 



1 - Response indicating relationship to horse, 

goes on horse, cowboy sits on it and rides, horse thing. 



41. TEMPEMTURE. Probe: "What is it called" 



2 - Thermometer. a 

Tells or measures temperature, heat or cola. 



1 - other winter activities, i.e., ice skatl^, down the hill, 

playing in the snow, with or without mention of the boy. 



1 - 



Temperature. , fineelflcally mention thermometej 

your coat on. ^ 










CAPTAIN. Probe: "What kind of man" 

2 “ Captain. 

Man related to uniform, te., policeman, mailman. 

1 - Man. 

WHALE. Probe: "What kind" 

2 - Whale. Porpoise. 

1 - Fish. Flipper. Seal. 

BALANCING, Probe: "What Is he doing" or "What Is he called" 

2 ~ Response must mention seal and ball related In an effort 

activity, l.e., balancing, holding, bouncing, playing with 
the ball. 

1 ~ Seal (or similar animal, l.e., walr<i>s) and ball related but 

effort activity not explicit, l.e., seal with a ball. 

Animal (unspecified or wrong animal) In activity with the ball, 
l.e., animal bouncing the ball on his nose, skunk playing with 
the ball, l.e.. Correct activity related to the ball without 
mention of the animal, l.e., turning the ball around on his nose 

COBWEB, Probe. "What Is It called" 

2 - Cobweb. Spider web, net, or house. Web. 

1 - Umbrella. Spider thing. Insect house or equivalent, l.e., bug 

lives here, (spider alone not acceptable) . 

Function described. 

PLEDGING, Probe: "What Is she doing" 

2 - Girl, plus activity toward the flag, l.e., girl saying prayers to 

the flag, girl pledging allegiance. 

If the girl not mentioned, response must state "saying pledge to 
the flag" or "pledging allegiance". 

1 - Girl plus activity not related to the flag, l.e., girl saying 

prayers. Girl and flag mentioned but unrelated. 

Description of activity, l.e., putting her hand over her heart. 

HYDRANT. Probe: "What Is It called; or "What does It do" 

2 - Hydrant (water pump or water thing) plus function, l.e., water 

tank to spray out fire, thing fireman puts hose to for water, 
water pipe to clean streets. 

1 - Hydrant, water thing or object but limited to water source, 
l.e., for water to come out, that sprays water. 

Description of function, l.e., for cleaning the streets, puts 
out fire, squirts water In the summer. 

D 



50. BINOCULAR. ' Probe: "What does it do" 

2 - Binoculars. Response specifying magnification function. 

1 - Glasses. Things to look through. Thing to see. pictures in. 



51 . 



52. 



53 . 



58 . 



o 

; ERIC 



LOCOMOTIVE. 

2 - Locomotive. Train. Engine. 



1 - Choo-choo 

HIVE. Probe: "What is it doing" 



60 . 



2 - Flying insects with box or hive in a related activity, i.e., Uve 
there, make honey, going home. (Response must indicate functional 
relationship between hive and bees.) 



1 - Beed, and-or hive. 

REEL, probe: "What is it for" or "What is it called" 



2 - Reel. Functional description of line as used in fishing, l.e., 
something to pull in the fish. 



1 - Functional description of reel but not related to fishing. 
Response hiving to do with fishing rod. -Pencil sharpener. 



55. GNAWING. 



Probe: "What is he doing" or "What is he called" 



2 - Beaver in activity with the tree, l.e., eating, cutting, chewing, 
chopping or getting tree down. 



1 - Correct activity but animal is wrong or not specified, i.e., 

gQu.li*r©l chewing down the tree, animal eating the tree, gnawing 
the tree. Beaver mentioned but activity incorrect or not related 
to the tree (beaver Jumping in the tree) 



57. BANNISTER. 



2 •• Bsnnist©!?# 

Steps plus rail or pole or functional description of rail, 
i.e., thing you hold on to going up stairs. 



1 - Steps or stairs. 

IDOL. Probe: "What kind of man (or lady) 



2 - Idol. Statue. Non-living representation of a person. 
1 - Indian. Genie. King, Queon, (God or Jesus, angel) 



59. GLOBE. 



2 - Globe. Round world or map 

1 - Ball-map. Earth. Moon. World. Map. . . 

Functional description, i.e., thing that goes around and tells 

you where you are. 



WALRUS 

(g - Walrus. Seallon. Seal 




E 



1 - Flipper. Porpoise. 
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TABLE I PPVT-Expresaive 

Relitibilitv Coefficients for Pre & Po st Tast Scorti 

Split-half Kpdsr-Richardson 

Pre-test r*.78 

Post-test *77 











TABLE II MEAN STANFORD-BINET IQ*» by TKatER 



Prt-test 



Tester 

02 

04 

05 

06 
08 
09 . 

n 

12 



ei 

02 

03 

04 

05 

06 
07 
10 



N 

13 

68 

5^ 

12 

5 

39 

16 

8 



7 

56 

6 

17 

43 

17 

8 
9 



Post-test 



Mean 

84.08 

91.13 

94.52 

93.25 

77.80 

86.82 

87.75 

77.88 



102 

89.27 

85.17 

9^.88 

91.93 

9^.00 

88.25 

90.56 



Standard P«Ti»tioa 

9.51 

10.50 

15.64 

7.80 

3.12 

15.20 

9.58 

10.87 



12.60 

12.67 

9.19 

13.69 

15.71 

9.91 

7.79 

10.80 




G 



frt*r»wittaiMtaMa<<lteiillrtttiilttaaaaaMa^ 
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TABLE III 



MEAN PPVT-R IQ«s by TESTER 
Fre-test 



Terter 



20 

21 

22 

25 

26 
27 
30 



23 

24 
28 
29 
31 



N 



Mean 



Standard PeTiation 



36 

15 
101 

8 

31 

16 
4 



82.47 

93.87 

83.60 

8X.63 

85.71 

91.00 

92.00 



12.70 

9.76 

16.69 

13.07 

18.51 

U.85 

4.58 



Post-test 



28 

63 

47 

19 

27 



83.75 

9^.92 

95.'*7 

89.42 

83.70 



18.19 

13.42 

15.16 

10.85 

15.49 



H 
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TABLE IV MEAN PPVT-E RAW SCORES by TESTER 

Pre-test 



Tester 


N 




Mean 


Standard BsTiation 


4l 


27 




26.59 


5.44 


42 


35 




27.51 


6.91 


43 


6 




22.67 


1.60 


44 


85 




30.12 


5.07 


45 


50 




26.78 


5.33 


46 


15 


Post-test 


17.73 


4.84 


40 


49 




28.31 


5.24 


41 


14 




26.14 


3.42 


43 


30 




29.83 


5.73 


44 


65 




33.28 


6.06 


45 


6 




35.67 


7.27 




TABLE V 



Msvior Inventory 
Factor Structure 

Factor I ; Withdrawing, Inhibited, Caaper Milquetoaat 
(Sharad Variance«34.67> 

Iten 

22. la constricted, inhibited, or tinid; needs to be 

urged before engaging in activities. -.826 

24. Is reluctant to talk to adults; responds verbally 

only when urged. -.811 

7. Often keeps aloof fron others because he is uninterested 

suspicious, or bashful. -.772 

49. Approaches new tasks timidly and without 

assurance; shrinks from trying new things. -.764 

46. Is lethargic or apathetic; has little energy or drive. -.728 

35. Is eager to inform other ctiildren of the experiences 

he has had. +.684 

39. Asks many questions for information about things, 
persons, etc. {Emphasis here should be on questions' 
prompted by genuine curio<iity rather than bids for +.678 

attention. 

5. Talks eagerly to adults about his own experiences and 

what he thinks. +.678 

34. Often will not engage in activities unless strongly 

encouraged. -.677 

1. Is usually carefree; rarely becomes frightened or 

apprehensive. +.595 

33. Likes to talk with or socialise with the teadier. +.576 

28. When faced with a difficult task, he either does 

not attempt it or gives up very quickly. -.565 



J 



Fac tor II: Hoatlla* Iwpulaiva, Loir controlled, Donald Itack 

(Shared Variance«10.52) 

Item 

36. Enotional responae ie customarily very strong; 
over^’responds to usual classroom problems, frustrations, 
and difficulties. 

19. Is excessive In seeking the attention of adults. 

18. Responds to frustration or disappointment by becoming 
aggressive or enraged. 

16. Has little respect for the rights of other children; 

refuses to wait his turn, usurps toys other children are 
playing with, etc. 

26. Is often quarrelsome with classmates for minor reasons. 

10. Is jealous; quick to notice and react negatively 
to kindness and at tent 1cm bestowed upon other 
children. 

37. Is uncooperative In group activities 

23. Is even-tempered. Imperturbable; Is rarely annoyed 
or cross . 

Factor 111; Mon-perslstence In Problem Solving 
(Shared varlance«4.98) 

11. Is methodical and careful In the tasks that he 
undertakes • 

20. Stlc^ with a job until It Is finished. 

40. Usually does what adults ask him to do. 

45. Is wanted as a playmate by other dilldren. 

13. Tries to figure out things for himself before asking 
adults or other children for help. 

2. Is sympathetic, considerate, and thoughtful toward 
others. 

25. Works earnestly at his classwork or play; does not 
take It ll^tly. 



•►.686 

•►.647 

•ii636 

•►.615 

+.601 

+.593 

+.570 



-.568 



+.708 

+.642 

+.631 

+.631 

+.586 

+.576 

+.562 



K 
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TABLE VI 



Stanford-Biuet FACE SHEET 
FACTOR STIBJCTUBE 

Factor I* Eaotional* 

■■■■ " ■ I ■ , I ■ ■ ■ I .P ^ 

Eaotional Indapendanca; aalf confldanca; 
naada Uttia ago aupport for problaa aolving. 

8» Aaaurad 

i 

6. Kaallatically aalf-confidant 

7. Comfortable In adult company 

13. Naada minimum of commendation 

Factor 11. Problem Solving . 

Problem aolving paraiatanca; raaponaa to 
diallanga of problem; taafc oriented 
(intaxmadiata riak taker). 

11. Eager to continue 

12. Challenged by hard taaka 

9. Paraiatant 

10. Baacta to failure raaliatically 

Factor III. Social . 

Initiataa activity; quick to raapond; 
aocially confidant. 

3. Initiataa activity 
5. Socially confidant 

4. Quick to raapond 

Factor IV. Attention . 

Abaorbed by taak; normal activity 
level; attention apacificity. 



Loadini 



.840 

.731 

.623 

.567 



.797 

.734 

.701 

.576 



.783 

.642 

.640 



.693 



2. Normal activity level 
1. Abaorbed by taak 



.686 



APPENDIX 



Younf^er Sample : 




A total of 79 chll-dren of the 90 who were eligible were found 
and tested. The purpose in selecting a population who experienced 
a summer Head Start program, who did not attend kindergarten or 
another pr 3 -school program the following year, and who might return 
the subsequent summer for a second Head Start experience before 
entering kindergarten, was to study the effect of multiple summer 
programs. This population represents the first stage of a longtl- 
tudlnal study and is. reported here In the form of a progress report. 
Table VII (Appendix) summarizes the several measures taken of these 
children in the late fall after their first Head Start experience. 

Of Interest here is the rough comparability of several of the 
measures betwee.n this group and the older children. Means for the 
several groups are the same, and all differences between boys and, 
girls in the younger group are in the same direction as in the 
older group. The discrepancy between the Blnet and PPVT-R in the 
high and low Blnet younger children corresponds to this discrepancy 
in the older children. The only inconsistency Is in the PPVT-E and 
PPVT-R discrepancy. The younger population has no discrepancy 
between these measures above or below the median Blnet IQ. It should 
be recalled that for the older children, those below the median 
Binet were superior on the E, whereas those above the median Blnet 
were superior on the R. These differences may reflect the age 
differences in the populations and will be examined further as 
subsequent data become available. 
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TABLE. VII. MEANS, STANDARD DEVIATION AND DISCREPANCY 

SCORES PPVT-R, PPVT-E, STANFORD BINET 
(Younger Sample) 
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